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Abstract 

The paper is devoted to quantization of extensive games with the use of both the 
Marinatto- Weber and the Eisert-Wilkens-Lewenstein concept of quantum game. 
We revise the current conception of quantum ultimatum game and we show why 
the proposal is unacceptable. To support our comment, we present the new idea 
of the quantum ultimatum game. Our scheme also makes a point of departure for 
a protocol to quantize extensive games. 

1 Introduction 

During the last twelve years of research into quantum games the theory has been already 
extended beyond 2x2 games. Since majority of noncooperative conflict problems are 
described by games in extensive form, it is interesting to place extensive games in the 
quantum domain. Although there is still no commonly accepted idea of how to play 
quantum extensive games, we have proved in [I] that it is possible to use the frame- 
work |2j of strategic quantum game to get some insight into quantum extensive games. 
Namely, we have shown that a Hilbert space J^ 5 = C 2 £g>C 2 £g>C 2 , a unit vector \ipm) £ ^ 
the collection of subsets -,=1,2,3 of SU(2), and appropriately defined functionals E\ 
and E2 express the normal representation of a two stage sequential game. Moreover, it 
allows to get a result inaccessible in the game played classically. In this paper, the above- 
mentioned quantum computing description will be used to the two proposed variants of 
the ultimatum game [5]. It is a game in which two players take part. The first player 
proposes one of two proposals how to divide a fixed amount of good. Then the second 
player either accepts or rejects the proposal. In the first case, each player receives the 
part of goods according to player l's proposal. In the second case, the players receive 
nothing. A game-theoretic analysis shows that player 1 is in a better position. Since 
player 2's rational move is to accept each proposal, player l's rational move is to make 
the best proposal for her. As we will show in this article, the Eisert-Wilkens-Lewenstein 
(EWL) approach pQ as well as Marinatto- Weber (MW) approach [7] can change the 
scenario of the ultimatum game significantly improving the strategic position of player 
2. Our paper also provides an argument indicating that the previous idea [8] of quantum 
ultimatum game is not sufficient to describe the game in the quantum domain. We will 
explain that, in fact, the formerly proposed protocol does not quantize the ultimatum 
game but another 2x2 game. The last part of the paper is devoted to a form of a game 
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tree where we provide the procedure how to determine the game tree when the game is 
played according to the MW approach. 

2 Preliminaries to game theory 

Definitions in the preliminaries are based on [TT| . This section starts with a definition 
of a finite extensive game. 

Definition 2.1 Let the following components be given. 

• A finite set N = {1, 2, . . . , n} of players. 

• A set H of finite sequences that satisfies the following two properties: 

1. the empty sequence $ is a member of H ; 

2. if (a k )k=i,2,...,K G H and K > 1 then {a k )k=i,2,...,K-i G H. 

Each member of H is a history and each component of a history is an action taken 
by a player. A history (a\, a 2 , . . . , a#) G H is terminal if there is no ax+i such 
that (ai, a%, . . . , ax, olk+x) G H. The set of actions available after the nonterminal 
history h is denoted A(h) = {a: (h,a) G H} and the set of terminal histories is 
denoted Z. 

• The player function P: H \ Z — > N U {c} that points to a player who takes an 
action after the history h. If P(h) = c then chance (the chance-mover) determines 
the action taken after the history h. 

• A function f that associates with each history h for which P{h) = c an independent 
probability distribution f(-\h) on A[h). 

• For each player i G iV a partition Xj of {h G H\Z : P{h) = i} with the property that 
for each Ii G Xj and for each h, hi G lj an equality A(h) = A(h') is fulfilled. Every 
information set Ii of the partition corresponds to the state of player's knowledge. 
When the player makes move after certain history h belonging to Ii, she knows 
that the course of events of the game takes the form of one of histories being part 
of this information set. She does not know, however, if it is the history h or the 
other history from Ii . 

• For each player i G N a utility function Ui m . Z — > R which assigns a number 
(payoff) to each of the terminal histories. 

A six-tuple (N, H, P, f, {Ii}, {ui}) is called a finite extensive game. 

Our deliberations focus on games with perfect recall (although Def. 12. II defines extensive 
games with imperfect recall as well) - this means games in which at each stage every 
player remembers all the information about a course of the game that she knew earlier 
(see [9] and [11] to learn about formal description of this feature). 

The notions: action and strategy mean the same in static games, because the players 
choose their actions once and simultaneously. In the majority of extensive games a player 
can make her decision about an action depending on all the actions taken previously by 
herself and also by all the other players. In other words, players can make some plans of 
actions at their disposal such that these plans point out to a specific action depending 
on the course of a game. Such a plan is defined as a strategy in an extensive game. 
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Definition 2.2 A pure strategy Sj of a player i in a game (N, H, P, f c , {Xj}, {ui}) is 
a function that assigns an action in A(Ii) to each information set Ii G X. 

Like in the theory of strategic games, a mixed strategy U of a player i in an extensive game 
is a probability distribution over the set of player z's pure strategies. Therefore, pure 
strategies are of course special cases of mixed strategies and from this place whenever we 
shall write strategy without specifying that it is either pure or mixed, this term will cover 
both cases. Let us define an outcome O(s) of a strategy profile s — (si, S2, ■ ■ ■ , s n ) in an 
extensive game without chance moves to be a terminal history that results when each 
player i G N follows the plan of Sj. More formally, O(s) is the history (ai, a 2 , . . . , ax) G Z 
such that for < k < K we have Sp( ftl)<l3) ... i0fc )(ai, a 2 , . . . , a^) = a k+1 . If s implies a history 
that contains chance moves, the outcome O(s) is an appropriate probability distribution 
over histories generated by s. 

Definition 2.3 Let an extensive game T = (N, H, P, {Ii}, {ui}) be given. The normal 
representation of T is a strategic game (N, {Si}, {u'j}) in which for each player i G N: 

• Si is the set of pure strategies of a player i in T; 

• K : YlieN Si ~R defined as u'^s) := Ui(0(s)) for every s G YlieN an( ^ * e ^ ■ 

One of the most important notions in game theory is a notion of an equilibrium intro- 
duced by John Nash in [10]. A Nash equilibrium is a profile of strategies where the 
strategy of each player is optimal if the choice of its opponents is fixed. In other words, 
in the equilibrium none of the players has any reason to unilaterally deviate from an 
equilibrium strategy. A precise formulation is as follows: 

Definition 2.4 Let (JV, Si, {ui} ie iy) be a strategic game. A strategy profile ■ ■ ■ j C) 

is a Nash equilibrium (NE) if for each player i G iV and for all Sj G Si: 



A Nash equilibrium in an extensive game with perfect recall is a Nash equilibrium of 
its normal representation, hence Def. 12.41 applies to strategic games as well as extensive 
ones. 

3 The ultimatum game 

The ultimatum game is a problem in which two players face a division of some amount € 
of money. The first player makes the second one a proposal of how to divide € between 
them. Then the second player has to decide either accept or reject that proposal. The 
acceptance means each player receives a part of € according to the first player's proposal. 
If the second player rejects, each player receives nothing. Let us consider the a variant 
of the ultimatum game in which player 1 has two proposals to share €!: a fair division 
Uf = (€1/2, €1/2) and unfair one u u = (5€, (1 — where the 5 is a fixed factor such 

that 1/2 < 5 < 1. This problem is an extensive game with perfect information that 
takes the form: 




where t*_ , = (t{, . . . , t* t _ v t* +1 , . . . , t* n ). 



(1) 




(2) 



with components defined as follows: 



• H = {0, c , ci, (c , d ), (c , di), (a, e ), (ci, ei)}; 
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• P(0) = 1, P(c ) = P(ci) = 2; 

. X 1 = {0} ) X 2 = {{(c o )},{(c 1 )}}; 

• n(c ,rf ) = (€/2,€/2), u (c a , e ) = (6€, (1 - <T)€), 
u(co,di) = w(ci,ei) = (0,0). 

The extensive and the normal representation of r x is shown in Figure HJ Equilibrium 
analysis of the normal representation gives us three pure Nash equilibria: (co,<io e i)) 




Figure 1: A two proposal ultimatum game Ti: an extensive form a) and a normal 
form b). 

(ci, G?o e o) and (ci, c?ieo). There are also mixed equilibria: a profile where player 1 chooses 
c and player 2 chooses (i eo with probability p < 1/(25) and d ei with probability 1—p, 
and a profile where player 1 decides to play c\ and player 2 chooses any probability 
distribution over strategies d^e® and d\eo. However, we can put these ones aside since 
both mixed equilibria do not contribute to the utility outcomes of IY They generate the 
same utility outcomes as the pure ones: (€/2, €/2) and (<5€, (1 — 5)€), respectively. The 
key feature that make the game Ti so curious is that only equilibrium profile (ci,d e ) 
with unfair outcomes (S€, (1 — S)€) is a reasonable scenario among all the equilibria 
of the ultimatum game (many experiments show that people are inclined to choose fair 
division (€!/2,€!/2), however we stick to the natural assumption of game theory that 
players are striving to maximize their payoffs). The strategy combination (ci,<io e o) 
is the unique equilibrium that is subgame perfect (the idea of subgame perfection is 
the well-known equilibrium refinement formulated by Selten [13]) i.e. it is a profile of 
strategies that induces a Nash equilibrium in every subgame (there are three subgames 
in Ti: the entire game, a game after the action Co and a game after the action c±). At 
the same time the subgame perfection rejects equilibria that are not credible. Let us 
consider the profile (c ,do e i)- Here, the strategy d ei of player 2 demands the action 
ei when player 1 chooses c\. However, when c\ occurs, a rational move of player 2 
is eo- Similar analysis shows that also (ci,dieo) is not subgame perfect equilibrium. 
Although the notion of subgame perfection is related to the extensive form of a game, 
we can easily determine subgame perfect equilibria in any two stage extensive game with 
perfect information (or even in a wider class of extensive games) through an analysis 
of its normal representation. In the game Y\ an action taken by player 1 determines a 
subgame in which only player 2 makes a move. Thus subgame perfect equilibrium in Y\ 
is a Nash equilibrium with a property that a strategy of player 2 is the best response to 
every strategy of player 1 (i.e., a strategy that weakly dominates the others). 
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4 Criticism of the previous approach to 
quantum ultimatum game 

A misrepresentation of the classical ultimatum game is the source of its incorrect quan- 
tum representation in [8] . The author describes the ultimatum problem as a 2 x 2 game 
and then applies the MW and the EWL schemes to construct the quantum game. How- 
ever, as we have seen in Figure [Tb, 2 x 4 is a minimal dimension allowing to represent 
the ultimatum game in normal form. A hypothetical case of the ultimatum game in 
which player 2 has only two strategies after an action taken by player 1 implies that 
player 2 is deprived of capability to make her move conditioned on the action of the 
first player. That is tantamount to an event where the players take their actions at the 
same time or one of the players chooses her action as the second but she does not have 
any information about an action taken by her opponent. It does not correspond to a 
description of the ultimatum game where the second player knows a proposal of her 
opponent and depending on the move of the first player she makes her action. Although 
the player 2 has only two actions: accept or reject in the two-proposal ultimatum game, 
in fact she has four pure strategies defined as her plans of an action at each of her 
information sets. Therefore, a 2 x 2 strategic game cannot depict the ultimatum game. 
Consequently, the MW and the EWL approach used for quantization of a 2 x 2 game 
cannot produce a quantum version of this game. Neither of these quantum realizations 
contains the classical ultimatum game. 

5 The quantum ultimatum game obtained by quan- 
tization of the normal representation of the clas- 
sical game 

First, let us remind the protocol for playing quantum games defined in [3]. It is a 
six-tuple: 



where the components are defined as follows: 

• is a complex Hilbert space <S)j=i ^ 2 with an orthonormal basis B. 

• N is a set of players with the property that \N\ < m. 

• |^i n ) is the initial state of a system of m qubits \<fi), \(f2), ■ ■ ■ , \<£m)- 

• £ : {1,2,..., m} — > N is a surjective mapping. A value £(j) indicates a player who 
carries out a unitary operation on a qubit \ipj). 

• For each j G {1,2,..., m} the set lAj is a subset of unitary operators from SU(2) 
that are available for a qubit j. A (pure) strategy of a player i is a map Tj that 
assigns a unitary operation Uj G Uj to a qubit \fj) for every j G The final 
state iV'fin) when the players have performed their strategies on corresponding 
qubits is defined as: 



r QI = (^,AgvO,U^},{£ 8 }) 



(3) 



l^fin) := (ti,t 2 



•••,T„)|Vin)=(g) ^#in). 



(4) 
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• For each % G N the map Ei is a utility (payoff) functional that specifies a utility 
for the player i. The functional Ei is defined by the formula: 

Et=J2 v i(P)\( b \M 2 , where Vi (b) G R. (5) 
\b)eB 



The above scheme is adapted for extensive games with two available actions at each 
information set so that we could use only qubits for convenience. Any game richer 
in actions can be transferred to quantum domain by using quantum objects of higher 
dimensionality. 

The idea framed in [1] bases on identifying unitary actions taken on a qubit with 
actions taken in an information set of classical game. Therefore, three qubits are required 
to express the ultimatum game in quantum information language. Since the first player 
has one information set and the second player has two ones, player 1 performs a unitary 
operation on only one qubit and player 2 operates on the rest. Like in [5] we examine 
the two approaches: the MW approach and the EWL approach to quantizing I\. 

5.1 The MW approach 

Let us consider the following six-tuple: 

If w = (JP e , {1, 2}, |^>, £, {{a , ax}*}, {Ei}) , (6) 

where: 

• Jif c is a Hilbert space ®j=i ^ 2 with the computational basis states \xi,x 2 , x 3 ), 
Xj = 0, 1; 

• the initial state |-?/>m) is a general pure state of three qubits: 

l^in) = 2j ^\x), where X x G C and ^2 \^x\ 2 = M ( 7 ) 

a;G{0,l} 3 ze{0,l} 3 



the map £ on {1, 2, 3} given by the formula: 



ao = ( I 1 ) and ai = ( 1 I 
the payoffs functionals Ei, i = 1,2, are of the form: 



1, if j = i; 

2, if j'G {2,3}. 



El = l € E I < 00 ' ^IV'fin) | 2 + 5€ ^ I (1, X 2 , 0|^ fin 



^ = ^^|(00,a;3|^ fi n)r + (l-5)€^|(l,x 2 ,0|^ fin )p 



2 

^3 ^2 



By definition of £ in Ii, player 1 acts on the first qubit and treats the operators o-q 
and a\ as her strategies. Player 2 acts on the second and the third qubit, hence her 
pure strategies are o\ (8) ctq, <Jq <8> cf , of ® erg and of <g) of (the upper index denotes a 
qubit on which an operation is made). Let us determine for each profile (cr^, (cr^ 2 , cL)), 
where ki, k 2 , K3 G {0, 1}, the corresponding expected utility Ei by using formulae ©-© 
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P2 



OqIEIOq dJXlai of®ojj (7j® CTj 



(MWK + (M 2+ M 2 )"» 




(I^p+I^PK+(M 2 +M> u 




(M^P^ + ^P+iyX 


(l^pH^p^+^p+I^PK 




(|x 6 p+|x 7 p) % +(|x,p+|A 3 pK 



Figure 2: The MW approach to the normal representation of Ti. 

and the specification of <Q. We illustrate it using as an example Ei (<7q, (of , Oq)) for 
i — 1,2. The initial state after the players choose the profile (<Tq, (of, (Tq)) takes the form 
l^fin) = o"o ® a i ® o'olV'in)- Thus, we have: 

IV'fln) = X] ^i,X2,x 3 kl,^2,2;3), (9) 

£i,£2,:E3G{0,l} 

where £2 is the negation of x 2 . Putting the final state (Q into the first of Eq. ([8]) we 
obtain: 

E x (o- \ (a?, a 3 )) = ^€ (|A io| 2 + |A n| 2 ) + 5€ (|A 100 | 2 + |A 110 | 2 ) . (10) 

Obviously, we have (1 — 5)€> instead of in the expected utility E 2 . Therefore, the 
payoff vector (Ei,E 2 ) is Uf (|Aoio| 2 + |Aon| 2 ) + w u (|Aioo| 2 + |Aho| 2 ) in that case. Payoff 
vectors (Ei,E 2 ) for all possible profiles {cr l Ki , (c 2 2 ,cTk 3 )) are placed in the matrix repre- 
sentation in Figure H] (for convenience we convert binary indices (xi,X2,x 3 ) 2 of \ Xl ,x 2 ,x 3 
to the decimal numeral system). 

Let us examine the game in Figure [2] to answer to what degree passing to the quantum 
domain may influence the result of the game. Notice first that ([6]) is indeed the quantum 
game in the spirit of the MW approach - the normal representation of Ti can be obtained 
from r^ w by putting |A | 2 = 1 and \X X \ 2 = for x = 1, 2, . . . , 7, i.e., if we put \ip in ) = 
1 000) . More generally: r^ w coincides to a game isomorphic to the normal representation 
of Fi if we put as \ip in ) = \xi, x 2 , x 3 ) any basis state. Then r^ w is equal to Ti up to the 
order of players' strategies. The game T\ favors player 1 as we have learnt in Section |3] 
Thus, an interesting problem is to look for another form of the initial state ([7]) that 
imply fairer solution unavailable in the game Fx. Let us study first: 

l^im) = \ (looo) + |001) + |100) + |110» . (11) 

Through the substitution |Ao| 2 = |Ai| 2 = | A4 1 2 = | Ae | 2 = 1/4 (the other squares of the 
moduli equal 0) to entries of the matrix representation in Figure [2] we obtain a game 
where the only reasonable equilibrium profile is <Jq(£><Jq®<Jq with corresponding expected 
utility vector E = (Ei, E 2 ) equal (uf + u u ) /2. The other pure equilibria: a\ £g> (Jq <S> crf 
and a\ ® a\ ® a\ - both generating the utility outcome (uf + u u ) /4 are obviously worse 
for both players so they won't be chosen. Moreover, a\ £g> cTq ® <Jq is an imitation of 
a subgame perfect equilibrium - the strategy of the second player a\ <g> <Tq is the best 



response to any strategy of the first player. To sum up, the initial state (TTTj) is beneficial 
to player 2 compared with the classical case. It turns out that the answer to the question: 
is there any |^ in ) allowing to obtain a fair division of €!, is also positive. Let us consider 
any state of the form: 



1^2} = 1/^1000} + Jl_-L|001), where ~ < 6 < 8' < 1. (12) 
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Once again the profile erg <S> cx 2 <g> ctq constitutes a Nash equilibrium and the strategy of 
the second player a 2 <8> o"q weakly dominates her other strategies as a result of putting 
| Ao | 2 = 1/25' and |Ai| 2 = 1 — 1/25' in the game in Figure [2j Since there are no other 
profiles with that property, <Jq £§> a 2 <g> <7q is the most reasonable scenario that implies 
E lt2 {(7Q g) (Tq <g> Ctq) = €J/2. The superposition of the third qubit (the second qubit of 
player 2) is essential to obtain fair division result since it is impossible to achieve 5€ by 
player 1 then. Therefore, the payoff €/2 becomes the most attractive for her now. 

The conclusions we can draw from the analysis of the MW approach to the ultimatum 
game are as follows. First, the game r^ w that begins with iVw) discloses a game tree 
different from the one in Figure [T^,). If there is a protocol for quantizing the extensive 
game T\ directly without using its normal representation as in our case, then the output 
game tree must be different from the game tree of Ti in general. It follows form the fact 
that the game tree in Figure [T^i with any four utility outcomes assigned to its terminal 
histories implies the normal representation specified by only these four payoff outcomes. 
However, the game r^ w where the initial state take the form of (II ip has five different 
outcomes. Notice, that is not irrelevant issue bearing in mind the fact that the bimatrix 
of a strategic game played classically as well as played by the MW protocol always have 
the same dimension. 

The case where game begins with the state ( lT2"j) is applied shows that even a separable 
initial state can influence significantly a result of Ti. It is not strange property. Any 
superposition of a player's qubit causes some limitation on players' influence on their 
qubits as we have seen in the case (1TT]1 . In particular, if each qubit of the initial 
state is in the state |+) = (|0) + |1)) /a/2, no player can affect amplitudes of her qubit 
applying only cr and a\ (measurement outcomes and 1 on qubit occur with the same 
probability). Then the result of the game only depends on the initial state |?/>i n ) = 



5.2 The EWL approach 

As we have seen, the two-element set of unitary operators is too simple in some cases. 
The two-parameter unitary operations used in the EWL protocol allow to avoid player's 
powerlessness when she acts on |+), and generally each player can essentially affect 
amplitudes of the initial state. Thus, it is interesting to find a result of the ultimatum 
game played according to the EWL approach. Let the following six-tuple be given: 

rf WL = (jr., {i, 2}, i^ooo), e, {{u(o, ffli}, m) , (13) 

where: 

• J%1 is a Hilbert space ®j=i ^ 2 with the basis {\if) xi ,x 2 ,x 3 ) '■ %j = 0, 1} of entangled 
states defined as follows: 

v _ \xi,x 2 ,x 3 ) + z|xi,x 2 ,x 3 ) _ 

• the mapping £ is the same as in six-tuple (fU); 

• the player's actions {U(9,(3): 9 G [0,7r],/3 e [0,7r/2]}, studied, for example, in the 
paper |3j, form an alternative to two-parameter unitary operations used in pQ. 
They are of the form: 

/ cos(£/2) ^sin(0/2)\ n ^ 
U W>P>- I ie-tfsin(0/2) cos(#/2) i' [ ^ } 



S 



Ei for i = 1, 2 are the payoff functionals (jHJ) defined for the basis (flllk 



J-:i 



■)-■> 



(16) 



J' 3 



•1'2 



Each strategy {7i of player 1 is simply {7(0i,ft). The strategies of the second player are 
chosen in a manner similar to r^ w - they are tensor products U 2 <8> C/3 = {7(02, ft) <8> 
^7(03> ^3). The final state IV'fin) corresponding to a profile r = ((#i, ft), (6*2, ft, 6*3, ft)) is 
as follows: 



iV'fin) = U x ® U 2 ® U 3 \ip oo) = -j= ^2 v *\ x ) 



(17) 



xe{o,i} 3 



where 



n 



I |_cos 

3 



Xj 7T - 6j 



n 



COS 



^•7T - 6j 



18) 



and 7 = 1, 2, 3, Xj = 0,1, and is negation of Xj. Putting fll6p and (I17p into formula 
(JSJ) we obtain the following expected payoff vector (Ex(t), E 2 {t))\ 

(E 1 (t),E 2 (t)) = 



2 #1 2 ^2 / 2 ^3 . . 2 ^3 2 o 

cos — cos — cos h sin — COS ft 

2 2 V 2 2 y 



0(0 
+ sin 2 -± sin 2 sin 2 sin 2 (ft + ft + ft) + cos 2 sin 2 (ft + ft; 



+ «u 



2 2 
0i 



03 



2 w l 2 p 3 / 2 p 2 

sin — cos — 
2 2 



cos 2 y cos 2 ft + sin 2 y cos 2 (ft + ft) 



9 01 2 03 I 2 02 2 / o /on 2 02 2 n 

+ cos —sin — I sin — sm (ft + ft) + cos — sin ft 



(19) 



Let us check first that rf WL generalizes the classical ultimatum game T\. Pure strategies 
of the first player are represented by {7(0, 0) and U(tt, 0). Similarly, the set of strategies 
of the second player in T 1 is represented by a set {{7(0 2 ,O) <g> {7(0 3 ,O): 2 , 3 G {0,7r}} 
since the set of profiles 



{((0 1 ,O),(02,O,0 3 ,O)):0 1 ,02,0 3 e{O,vr}} 
in f lT3|) and the set of profiles 

{(c kl , d k2 e ks ) : k u k 2 , k 3 E {0, 1}} 



(20) 



(21) 



in 02]) generate the same payoff's. Equivalents of behavioral strategies of Ti (i.e., inde- 
pendent probability distributions p, q and r over the actions c kl , dk 2 and e ka , respec- 
tively, specified by players at their own information sets) can be found among unitary 
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strategies as well. If we restrict unitary actions to {7(0,0), i.e., to profiles of the form 
((0i, 0), (0 2 , 0, 3 , 0)), 6j E [0, vr], the right-hand side of Eq. (USD takes the form: 

2 2 ^2 . 2 ^l 2 03 / 00 n 

Mfcos —cos — + M u sm —cos — . (22) 

By substituting p for cos 2 (0i/2), q for cos 2 (02/2), and r for cos 2 (03/2) we get the expected 
payoffs corresponding to any behavioral strategy profile ((p, 1 — p), {{q, 1 — q), {r, 1 — r))) 
in rV 

Let us examine an impact of the unitary strategies on a result of the EWL ap- 
proach to IY In particular we ask the question if the unfair division u u or the fair 
division Uf in rf WL is more probable. Notice, that the profile ((0 X , ft), (9 2 , ft, 03, ft)) = 
((7r, 0), (0, 0, 0, 0)) (corresponding to subgame perfect equilibrium (ci,G?oeo) in Ti) is 
not Nash equilibrium in rf WL . The second player can gain by choosing, for example, 
(0 2 , ft, 03, ft) = (71", it/2, 71, 0) instead of (0,0,0,0). Then she obtains the fair devision 
payoff. Moreover, for any other strategy of the first player (81, Pi), player 2 can select, 
for instance, (0, 0, 0, 0) to obtain a payoff being a mixture of Uf and u n . This proves 
that the unfair division u n cannot be a result in (fl3|) . The fair division m in turn can 
be achieved through continuum of Nash equilibria. Let us denote by NE(rf WL ) the set 
of all Nash equilibria of rf WL . An examination of (Tl9|) shows that: 



((tt, ft), (tt, 02, vr, ft)) : ft + ft < ^ , ^ ft = | J C NE(rf WL ) 



(23) 



as well as 



{((0, ft), (0, ft, vr, 0)) : ft, ft e [0, ~] } C NE(Ff WL ). (24) 

Moreover, all strategy profiles of these sets generate the payoff vector m for any division 
factor 1/2 < 5 < 1. To prove inclusion (J23"]) let us consider any strategy (9[, (3[) of player 
1 given that player 2's strategy from ( 1231) is fixed. Then for ft + ft < vr/4 we have 



J E 1 ((0 / 1 ,ft),(vr,ft,7r,ft)) 



X -€ sin 2 I sin 2 (ft + ft + ft ) + 6€ cos 2 | sin 2 (ft + ft 



(25) 



Since ft + ft < 7r/4, the maximum value of (|25p is achieved if the second element of the 
sum is 0. It implies that the best response of player 1 is 9[ = n and ft = n/2 — ft — ft. 
The second player cannot gain by deviating as well because she always obtains no 
more than €/2 in rf WL . Therefore, each profile of set ( |23|) indeed constitutes Nash 
equilibrium. Inclusion (124"]) can be proved in similar way. Notice that there are also 
Nash equilibria different from (12"3"]) and ( I24p that generate the payoff outcome €!/2 for 
both players. For example, a strategy profile {{Tt, vr/4), (n, it /A, it/2, 0)). 

Intuitively, a huge number of fair solutions in rf WL being NE together with a lack 
of an equilibrium outcome u u favors the second player in comparison to the classi- 
cal game Ti . However, it does not assure the second player the fair payoff €/2 
yet. Since the players choose their strategies simultaneously, they cannot coordinate 
them. If the first player unilaterally deviates from a strategy dictated by (124]) and 
she plays a strategy being a part of ( |23i) then both players receive nothing as we have 
£a,2((7T, ft), (0, ft, Tt, 0)) = for all ft, ft G [0,7r/2]. On the other hand, it turns out 
that the statement that each of these equilibria is equally likely to occur is not true. 
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Let us investigate which equilibria in Ti are preserved in Ff bearing in mind that 
the unitary strategies U(9, 0) are quantum counterparts to classical moves in F±. As we 
have seen there is no equilibrium profile in rf WL that allows the first player to gain <5€. 
Therefore, in particular, the unfair division equilibrium (ci, d eo) of I\ cannot be gener- 
ated by any unitary operations U(8, 0). However, each fair division equilibrium (pure or 
mixed) of (T5]) can be reconstructed in (1131) . The profile ((0, 0), (0, 0, tt, 0)) corresponding 
to the equilibrium (co, cfoei) in Ti is Nash equilibrium of rf WL since it is element of the 
set (jUj) . Next, the mixed equilibria mentioned in Section [3] can be implemented in rf WL 
as follows: they are the profiles where the first player chooses U (0, 0) and the second 
player chooses either [7(0,0)0*7(0,0) with probability p G [0, 1/25] and [7(0, 0)®U(ir, 0) 
with probability 1 — p, or in a language of behavioral strategies she just takes an op- 
erator from {[/(0, 0) <g> U(9, 0) : 9 G [2 arccos(l/v / 25), vr/2] }. According to the concept 
of Schelling Point [12] players tend to select a solution that is the most natural as well 
as the most distinctive among all possible choices. Therefore, if we assume that the 
players prefer the fair division, they choose a profile that is an equilibrium of both 1^ 
and rf WL among all equal equilibria of Ff WL . Since all these shared equilibria generate 
the same outcome, the pure equilibrium is the most natural and it ought to be chosen 
as the Schelling Point. 



6 Extensive form of the quantum ultimatum game 

In subsection 15. II we made observation that an extensive game and its quantum realiza- 
tion differ not only in utilities but also in game trees. Now, we are going to give the 
answer to the question how would a game tree of such quantum realization look like? 
Let us reconsider an extensive game form given by the game tree on Figure QJi, where the 
components H, P and Zj are derived from F\, and the outcomes Ooo, Oqi, Oiq and On, 
are assigned to the terminal histories (co, do), (cq, di), (ci, eo) and (ci,ei), respectively, 
instead of particular payoff values. Let us denote this problem as: 

r 2 = ({1,2}, H, P,li,0). (26) 

Then the tuple r^ w associated with F 2 is derived from r^ w and only the payoff func- 
tionals undergo appropriate modifications. Let us write r^ w in the language of 
density matrices, for convenience. That is: 

= (J1>, {1, 2}, Pin , £, {a , a x } h X) , (27) 

where 

• p in is a density matrix of the initial state (JTJ) ; 

• the outcome operator A is a sum of A + A 1 defined as: 

A = O 00 |00)(00| ® 1 + O i|01)(01| ® 1; 

A 1 = Oio|l)(l|®l®|0)(0|+O n |l)(l|®l®|l)(l|. 1 } 

In this case, the density matrix p nn of the final state takes a form 

Pfin = < ® < ® <p in < ® a 2 K2 ® oj,. (29) 
The outcome functionals §J5§ are then equivalent to the following one: 

E «,«,<)) =tr(Ap fin ). (30) 
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In order to give a extensive form to determine the final state pg n in I™ let us modify 
the way (129]) of calculating the final state pfi n . To begin with, player 1 acts on the first 
qubit. Next, player 2 carries out a measurement on that qubit in the computational 
basis to find out what is a current state of the game. Then she performs an operation 
on either the second or the third qubit of the post-measurement state depending on 
whether the measurement outcome or 1 has occurred. The operation of the second 
player ultimately defines the final state that is inserted to the formula f )30|) . The proce- 
dure can be formalized as follows: 



Sequential procedure 

1. (J \ 1 P'mO'\ 1 = P«i the player 1 performs an operation a 1 Kl on her qubit of 

the initial state p ln 

M L p Kl M L 

2. — t— — = Pm,Li the player 2 prepares the measurement {Mo, Mi} de- 

tr(M L p Kl ) ' fined by M t = |<,)(i|® 7 ® /, t = 0,1 on the first qubit of 

Pkx,l = tr(iW t p Kl)t ) the state a Kl p- m a Kl (the probability of obtaining result 

i is denoted by p Kl>l ) 

3. Si^Kit^Ka+^Kij^wa+t = ^f™ ^ a measuremen t outcome i occurs, the player 2 per- 

forms an operaton er Kt+2 on t + 2 qubit of the post- 
measurement state 



It turns out that for any strategy profile (o"* , (o^, )) the final state pfi n defined both 
by the formula fl29l) and by the sequential procedure determine the same outcome of the 
game T^ w . 

Proof. Let density operator p in of a state (j7j) be given. Then the state pg n after the 
third step of procedure can be expressed as: 

PL = <M p Kl M < + a^M^M.al 

= M a 2 K2 p K1 a 2 K2 M + M 1 a 3 K3 p K1 a 3 K3 M 1 . (31) 

Since X K M L = 5 KL X K , where 5 Ki is the Kronecker's delta, we obtain: 

tr(Xp^ n ) = tr(X°<p Kl < + lV^ Kl ay. (32) 

Notice that operation <j\ on the second (third) qubit of any state (JZj) does not influence 
the measurement of outcomes O\o and On (Ooo and Oqx), because of the form of X 1 
(X°), which means that: 

tr(XV^p Kl a^J=tr(X^ 2 ®^ 3 p K1 ^ 2 ®^ 3 ) for . = 0,1. (33) 

Inserting (I3"3l into the formula (J3"2"j) we get: 



tr (Xp' Rn ) = tr (X° + X 1 ) (g) aio in (g) ^ ? . (34) 




The right-hand side of f[3~4"|) is equal the expected outcome given by formula f[3"0"j) . Thus, 
the two ways of determining the final state are outcome-equivalent. □ 

We claim that performing quantum measurement is a more natural manner to play 
quantum games than observation of player's actions taken previously - the way suggested 
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'o.oo ^O.Ol ^0.10 "0.11 




Figure 3: The extensive game associated with the quantum realization Ttf- . 

by games played classically. Since the result of a quantum game is determined by the 
measurement outcome of the final state instead of actions taken by players, each stage of 
the quantum game also ought to be set via a quantum measurement of a current state. 
Moreover, when we suppose the second player's move dependence on actions of the first 
player in r 2 then it implies the same game tree as in Figure. [T^,). This way, however, 
stands in contradiction to the results in subsection 15.11 that tell us that the game trees 
must be different. Of course, if the initial state is 1 000) (000 1 (i.e., when game given by 
(I27j) boils down to a game fl2T)j) ). observation of the course of the game played classically 
and with the use of quantum measurement coincide. 

Let us study what a game tree corresponding to the game r^ w is yielded by the 
above-mentioned procedure. According to the first step, the initial history is followed 
by two actions of the first player. Next, the measurement on the first qubit is made. 
The two possible measurement outcomes i — 0, 1 can be identified with two actions 
(following each player l's move) of a chance mover that are taken with probability p Kl>L . 
Finally the player 2 acts on i + 2 qubit of the state p L after each history associated 
with the outcome l. Therefore, all histories followed by given outcome i constitutes an 
information set of player 2. Such description in a form of a game tree is illustrated in 
Figure EJ The outcomes O' K2 and 0[ K3 are determined by the following equations: 

°o.ici,«a = tr ( Xcr « 2 PM,o^ K2 ), 0' lKlK3 = tr(Xa K3 p Kl)1 a K3 ). (35) 

We have proved that the two approaches: ( I2"§j) and the sequential one to calculate 
the final state are outcome-equivalent. Therefore, it should be expected that extensive 
forms of r 2 and r^ w coincide when the initial state is a basis state. In fact, given 
Pin = 1 000) (000 1 the probabilities p Kljl are expressed by the formula p Kl)t = 5 Kul , where 
Ki, l G {0, 1}. Then, the available outcomes given by Eq. (135]) are as follows: O' Q0 = Ooo, 
O'om = Oqi, O' 110 = Oio and 0[ n = O n . By identifying a l Ki := c Kl , a 2 K2 := d K2 , 
cr? := e K3 the extensive game in Figure |3] represents game T 2 . 
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7 Conclusion 



We have shown that our proposal extends the ultimatum game in the quantum area. 
Although proposed scheme is suitable only for a normal representation of the ultimatum 
game in which some features of corresponding game in extensive form are lost, it passes 
on valuable information about how passing to the quantum domain influences a course 
of extensive games. The dominant position of player 1, when the ultimatum game 
is played classically, can be weakened in the case of playing the game via both the 
MW approach and the EWL approach. Another thing worth noting is that the the 
quantization significantly extends the game tree compared with classical case. It makes 
the normal representation to be more convenient way to analyze the game than the way 
of extensive form. 
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